367 research outputs found
Minimum Cost Active Labeling
Labeling a data set completely is important for groundtruth generation. In
this paper, we consider the problem of minimum-cost labeling: classifying all
images in a large data set with a target accuracy bound at minimum dollar cost.
Human labeling can be prohibitive, so we train a classifier to accurately label
part of the data set. However, training the classifier can be expensive too,
particularly with active learning. Our min-cost labeling uses a variant of
active learning to learn a model to predict the optimal training set size for
the classifier that minimizes overall cost, then uses active learning to train
the classifier to maximize the number of samples the classifier can correctly
label. We validate our approach on well-known public data sets such as Fashion,
CIFAR-10, and CIFAR-100. In some cases, our approach has 6X lower overall cost
relative to human labeling, and is always cheaper than the cheapest active
learning strategy
Acidic pH Triggers Lipid Mixing Mediated by Lassa Virus GP
Lassa virus (LASV) is the causative agent of Lassa hemorrhagic fever, a lethal disease endemic to Western Africa. LASV entry is mediated by the viral envelope glycoprotein (GP), a class I membrane fusogen and the sole viral surface antigen. Previous studies have identified components of the LASV entry pathway, including several cellular receptors and the requirement of endosomal acidification for infection. Here, we first demonstrate that incubation at a physiological temperature and pH consistent with the late endosome is sufficient to render pseudovirions, bearing LASV GP, non-infectious. Antibody binding indicates that this loss of infectivity is due to a conformational change in GP. Finally, we developed a single-particle fluorescence assay to directly visualize individual pseudovirions undergoing LASV GP-mediated lipid mixing with a supported planar bilayer. We report that exposure to endosomal pH at a physiologic temperature is sufficient to trigger GP-mediated lipid mixing. Furthermore, while a cellular receptor is not necessary to trigger lipid mixing, the presence of lysosomal-associated membrane protein 1 (LAMP1) increases the kinetics of lipid mixing at an endosomal pH. Furthermore, we find that LAMP1 permits robust lipid mixing under less acidic conditions than in its absence. These findings clarify our understanding of LASV GP-mediated fusion and the role of LAMP1 binding
On Achieving Diversity in the Presence of Outliers in Participatory Camera Sensor Networks
This paper addresses the problem of collection and
delivery of a representative subset of pictures, in participatory camera networks, to maximize coverage when a significant portion of the pictures may be redundant or irrelevant. Consider, for example, a rescue mission where volunteers and survivors of a large-scale disaster scout a wide area to capture pictures of
damage in distressed neighborhoods, using handheld cameras, and report them to a rescue station. In this participatory camera network, a significant amount of pictures may be redundant (i.e., similar pictures may be reported by many) or irrelevant (i.e., may
not document an event of interest). Given this pool of pictures, we aim to build a protocol to store and deliver a smaller subset of pictures, among all those taken, that minimizes redundancy and eliminates irrelevant objects and outliers. While previous work addressed removal of redundancy alone, doing so in the presence of outliers is tricky, because outliers, by their very nature, are different from other objects, causing redundancy minimizing algorithms to favor their inclusion, which is at odds with the goal of finding a representative subset. To eliminate both outliers and redundancy at the same time, two seemingly opposite objectives must be met together. The contribution of this
paper lies in a new prioritization technique (and its in-network
implementation) that minimizes redundancy among delivered
pictures, while also reducing outliers.unpublishedis peer reviewe
Multi-dimensional range queries in sensor networks
In many sensor networks, data or events are named by attributes. Many of these attributes have scalar values, so one natural way to query events of interest is to use a multi-dimensional range query. An example is: "List all events whose temperature lies between 50° and 60°, and whose light levels lie between 10 and 15." Such queries are useful for correlating events occurring within the network.In this paper, we describe the design of a distributed index that scalably supports multi-dimensional range queries. Our distributed index for multi-dimensional data (or DIM) uses a novel geographic embedding of a classical index data structure, and is built upon the GPSR geographic routing algorithm. Our analysis reveals that, under reasonable assumptions about query distributions, DIMs scale quite well with network size (both insertion and query costs scale as O(√N)). In detailed simulations, we show that in practice, the insertion and query costs of other alternatives are sometimes an order of magnitude more than the costs of DIMs, even for moderately sized network. Finally, experiments on a small scale testbed validate the feasibility of DIMs
- …